Recurrent Affine Transform Encoder for Image Representation
نویسندگان
چکیده
This paper proposes a Recurrent Affine Transform Encoder (RATE) that can be used for image representation learning. We propose learning architecture enables CNN encoder to learn the affine transform parameter of images. The proposed decomposes an matrix into two matrices and learns them jointly in self-supervised manner. RATE is trained by unlabeled data without any ground truth infers input images recurrently. inferred represent canonical form greatly reduce variations transforms such as rotation, scaling, translation. Different from spatial transformer network, does not need embedded other networks training with aid objectives. show achieves impressive results terms invariance translation, rotation. also classification performance enhanced more robust against distortion incorporating existing model.
منابع مشابه
Affine transform resilient image fingerprinting
Affine transformations are a well-known robustness issue in many multimedia fingerprinting systems. Since it is quite easy with modem computers to apply affine transformations to audio, image and video content, there is an obvious necessity for affine transformation resilient fingerprinting. In this paper we present a new method for affine transformation resilient fingerprints that is based upo...
متن کاملThe finite ridgelet transform for image representation
The ridgelet transform was introduced as a sparse expansion for functions on continuous spaces that are smooth away from discontinuities along lines. We propose an orthonormal version of the ridgelet transform for discrete and finite-size images. Our construction uses the finite Radon transform (FRAT) as a building block. To overcome the periodization effect of a finite transform, we introduce ...
متن کاملImage Representation Via a Finite Radon Transform
{ This paper presents a model of nite Radon transforms composed of Radon projections. The model generalizes to nite groups projections in the classical Radon transform theory. The Radon projector averages a function on a group over cosets of a subgroup. Reconstruction formulae formally similar to the convolved backprojection ones are derived and an iterative reconstruction technique is found to...
متن کاملAstronomical image representation by the curvelet transform
We outline digital implementations of two newly developed multiscale representation systems, namely, the ridgelet and curvelet transforms. We apply these digital transforms to the problem of restoring an image from noisy data and compare our results with those obtained via well established methods based on the thresholding of wavelet coefficients. We show that the curvelet transform allows us a...
متن کاملA New Affine Invariant Image Transform Based on Ridgelets
In this paper we present a new affine invariant image transform, based on ridgelets. The proposed transform is directly applicable to segmented image patches. The new method has some similarities with the previously proposed Multiscale Autoconvolution, but it will offer a more general framework and possibilities for variations. The obtained transform coefficients can be used in affine invariant...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2022
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2022.3150340